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(57) Abstract 

A purified binding protein se- 
lected from the group consisting of 
insulin-like growth factor binding 
protein having an amino acid se- 
quence which is at least 85 % homol- 
ogous to the amino sequence and 
fragments thereof comprising at 
least 10 consecutive amino acids of 
the sequence that are capable of 
binding to an antibody specific for 
the protein or to an insulin-like 
growth factor is described. Recombi- 
nant DNA molecules encoding the 
binding proteins and subsequences 
thereof are also described along with 
recombinant microorganisms and 
cell lines containing the DNA mole- 
cules and methods for preparing the 
binding proteins by growing the rec- 
ombinant hosts containing the rele- 
vant DNA,molecules. Antibodies to 
the protein, identified as IGFBP-5, 
which are useful in various diagnos- 
tic applications, are also described. 
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This application is a division of U.S. 
application serial number 07/574 , 613 , filed August 28 , 
1990 (attorney docket number CHIR-007/00 US). 

10 INTRODUCTION 
Field of the Invention 

This invention relates to genetic engineering 
of naturally occurring proteins and to corresponding 
recombinant proteins, genes, and gene segments and more 

15 specifically to such proteins and genetic elements 
derived from an insulin-like growth factor-binding 
protein, to methods and compositions which employ the 
genetic elements, and to gene segments that are useful 
in diagnosis. 

20 

Background 

Insulin-like growth factors (IGFs) are low 
molecular weight polypeptide hormones with structural 
homology to proinsulin. Two different IGFs are known, 

25 namely IGF-I and IGF-II, which are mitogenic in vitro 
for a wide variety of cells in tissue culture. Both 
IGFs stimulate in vitro the growth of various tissues 
and in particular they induce collagen synthesis. IGF-I 
mediates the growth promoting effect of growth hormone 

30 in chondrogenesis and bone formation and is therefore 
essential for normal growth of an individual. This is 
demonstrated by the fact that pygmies and toy poodles 
are deficient in IGF-I but have normal growth hormone 
level in their serum. IGF-II is believed to play a key 

35 role in fetal development and nerve growth. 

In addition to their primary effect on 
skel tal tissue th y also exhibit growth-stimulating 
functions on other tissues. Wound fibroblasts are 
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known to produce IGFs which are effective in stimu- 
lating fibroblasts to grow and synthesize collagen, a 
structural protein normally required for wound healing. 
Vascularization of the wound tissue is also induced. 
5 Further, it has also been found that IGFs have an 
erythropoietin-like activity in that they induce 
hematopoiesis . 

Recent studies have also demonstrated that 
IGFs produced by certain cancer cells, e.g. breast and 

10 kidney cancer cells, auto-stimulate the proliferation 
of cancer cells and the vascular and fibrous tissues 
required to support the growth of cancer tissues. 

In addition to this, both IGFs show a 
spectrum of metabolic activities similar to those of 

15 insulin, in that they stimulate, in particular, the 
transport and metabolism of glucose. The biological 
effects of IGFs and insulin are mediated through their 
binding to specific receptors. In particular, both 
IGFs have the ability to bind to the insulin receptor 

20 with approximately 100-fold lower affinity than does 
insulin. 

Both IGFs have a concentration in blood 
approximately a hundred- fold higher than that of 
insulin. Hypoglycemia is prevented by a regulatory 

25 mechanism which involves carrier proteins present in 

blood and able to form complexes with IGFs. Thus, IGFs 
circulate in the blood in the form of a complex which 
has no insulin-like activity. Through their 
association with carrier protein (hereinafter referred 

30 to as IGF -binding proteins or IGFBPs), binding of IGFs 
to cell surface receptors is inhibited. It has also 
been demonstrated that another function of the IGF- 
binding proteins is to increase the short half-life of 
IGFs, which are subjected to rapid proteolytic 

35 degradation when present in the free form in blood. 

In accordance with the foregoing, IGFs may be 
useful in vitro to stimulate a) the growth of animals 
and humans with growth hormone deficiency, b) tissue 
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regeneration, such as erythropoiesis and 
chondrogenesis, c) wound healing and d) the functions 
of various organs e.g. liver or kidney. As a result of 
their chondrogenesis stimulating activity, IGFs are of 
5 particularly suitable use for bone formation, e.g. in 
the treatment of osteoporosis. IGFs for use in the 
above-referred treatments are advantageously 
administered to a subject in association with at least 
one IGF-binding protein. Administration of this 

10 combination rather than IGF alone has beneficial 

effects including the prevention of hypoglycemia and 
possible mitogenic effects at injection sites and the 
prolongation of IGF half -life. Further, it has been 
found that binding proteins are also useful for 

15 potentiating the erythropoietin like-effect of IGF-I. 
The binding proteins may also be useful for targeting 
IGFs to specific tissues. 

When administered alone, i.e., without any 
IGF, the binding proteins may also be therapeutically 

20 useful for blocking the adverse effects of IGFs, such 
as those which occur when IGFs are produced in excess, 
e.g. free IGFs secreted by certain cancer cells e.g. 
hormone-producing cancer cells such as breast or kidney 
cancer cells. IGF-binding-protein therapy may also 

25 prevent blindness as a secondary effect of diabetic 
proliferation retinopathy. Indeed it has been shown 
that IGFs may be one of the factors stimulating 
endothelial and fibroblast proliferation in diabetic 
retinopathy. 

30 Another therapeutic use of IGFBPs is the 

control of excessive growth in IGF-binding-protein- 
deficient subjects, since it is very likely that high 
IGF levels combined with abnormally low levels of 
binding protein are responsible for excessive growth. 

35 m recent years, three major species of IGF- 

binding proteins, different in size and other 
properties, have b en detected in the serum of rodents 
and humans. 
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The first binding protein discovered, now 
called IGFBP-3, is a glycoprotein of approximat ly 150 
kd and composed of several subunits. Its formation, in 
contrast to that of the second, smaller IGF-binding 
5 protein, is growth hormone dependent. 

The second binding protein discovered, now 
called IGFB-1, has a molecular weight of approximately 
30 - 40 kd in the human and the rat. The human IGFBP- 
1 has already been purified from various sources 

10 including amniotic fluid (Povoa, G. et al., Eur. J. 

Biochem (1984) 144 : 199, (therefore also referred to 
as amniotic fluid binding protein) , placenta 
(Koistenen, R. et al., Endocrinology (1986) 118 : 
1375), and conditioned medium of hepatoma G2 cells 

15 (Powell, D. R. et al., J. Chromatogr. (1987) 420: 163). 
The first two binding proteins have been characterized 
by their amino acid contents and their N-terminal amino 
acid sequences, and found identical or at least very 
similar. The comparison of the amino acid sequences of 

20 the IGF-binding protein isolated from hepatoma G2 cells 
(Lee, Y. L. et al., Mol. Endocrinol. (1988) 2 (5): 404) 
and the IGF-binding protein cloned from a placenta cDNA 
library (Brinkman, A. et al.. The EMBO Journal (1988) 7 
(8) : 2417) reveals 99% homology. Further, these two 

25 amino acid sequences shown with the IGF-binding protein 
as coded by a cDNA library a homology of 94% (Brewer, 
M.T. et al., Bioch. Biophys. Res. Com. (1988) 152(3): 
1289). 

In addition to the two major forms of IGF- 
30 binding proteins present in serum several other IGF- 
binding proteins have been identified in different 
human tissue extracts and cell culture media by Western 
blotting techniques and affinity labelling with [ I 125 3- 
IGF, Their molecular weights range from 15 to 150 kd 
35 and some of these proteins appear to be generated by 
proteolytic degradation of the larger IGF-binding- 
proteins. In particular, a 53 kd IGF-binding protein 
which has been purified from human serum represents a 
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subunit of the 150-kd IGFBP-1 (Baxter, R.C. Biochem 
Biophys. Res. Com. (1966 139(3) : 1256) . 

Another form of IGF-binding protein has also 
been found in the conditioned medium from rat BRL-3A 
5 cells, and has a molecular weight of 33-36 kd 

approximately. A partial amino- terminal protein 
sequence of the rat BRL-3A binding protein has been 
determined (Mottolla, C. et al . , of Biol. Chem. (1986) 
261:11180; Lyons, R.M. Smith. G.L., Mol. Cell. 

10 Endocrinol (1986) 45:263). The 33% degree of homology 
shown by the rat and human terminal sequences is not 
high enough to allow the respective binding proteins to 
be considered as equivalents. 

An additional IGFBP, now called IGFBP-2, 

15 which is related to the BRL-3A binding protein, has 
also been found and its amino acid sequence fully 
established. The amino acid sequence of IGFBP-2 is 
distinct from that of the previously known binding 
proteins . 

20 The existence of a number of different IGF- 

binding proteins indicate that these proteins have 
different functions. Since it is possible to diagnose 
disease states and to modify in various different ways 
the biological activity of IGFs using the currently 

25 known binding proteins, there is significant interest 
in the discovery of additional IGF-binding proteins 
having different biological properties. 
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SUMMARY OF THE INVENTION 

Accordingly, it is an object of the present 
5 invention to provide means for producing an IGF- 

binding protein by genetic engineering techniques, the 
binding protein having biological properties different 
from those of IGFBP-1, IGFBP-2, and IGFBP-3. 

It is further an object of the present 

10 invention to provide recombinant DNA molecules capable 
of expressing the new IGF-binding protein in order that 
the binding protein will be more readily available* 

These and other objects of the invention have 
been accomplished by providing genetic information 

15 encoding a binding protein selected from a group 

consisting of an insulin-like growth factor binding 
protein having an amino acid sequence which is at least 
85% homologous to the amino acid sequence of Figure 1 
and fragments thereof comprising at least ten (10) 

20 consecutive amino acids of said sequence, wherein said 
purified binding protein is capable of binding to an 
antibody specific for said protein or to an insulin-like 
growth factor • Recombinant DNA molecules comprising a 
nucleic acid sequence encoding the new insulin-like 

25 growth factor binding protein and its fragments are also 
a part of the present invention along with recombinant 
microorganisms and cell lines capable of expressing the 
protein . 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic diagram showing the 
amino acid and nucleotide sequences of a clone encoding 
human IGFBP-5. 

Figure 2 is a schematic diagram comparing the 
35 amino acid sequences of a human binding protein of the 
invention, human IGFBP-5, to the known sequences of the 
three human binding proteins discussed above and to 
IGFBP-4, another newly discovered IGF-binding prot in. 
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Areas of homology can be seen in these s qu nces. These 
areas of homology are of particular interest as they 
indicated areas from which DNA probes can be obtained 
that have a high probability of success in finding 
5 related molecules. Two such homologous areas are 

indicated by brackets, although other areas of homology 
are also present. 

DESCRIPTION OF SPECIFIC EMBODIMENTS 

10 Novel compositions comprising generic 

sequences encoding IGFBP-5 and fragments derived 
therefrom are provided, together with recombinant 
proteins produced using the genomic sequences, and 
methods of using these compositions. The IGFBP-5 cDNA 

15 was initially isolated from a human osteosarcoma/XZAP 
cDNA lib ry using a two-step procedure. First, small 
fragmentb of the cDNAs encoding amino acids 1 to 15 of 
BP 5 were amplified from osteosarcoma cDNA by the 
polymerase chain reaction, gel purified, and sequenced. 

20 Second, perfect match-oligonucleotides were synthesized, 
based on the BPS nucleotide sequence between the PCR 
primers, and were used as probes to isolate cDNA 
clones. BPS cDNA clones showing the largest DNA insert 
size by agarose gel electrophoresis were sequenced. The 

25 nucleotide and encoded amino acid sequences of BPS are 
shown in Figure 1. 

Standard abbreviations for nucleotides and 
amino acids are used in these figures and elsewhere in 
this specification. 

30 A number of terms v<*ed in the art of genetic 

engineering and protein chemistry are used herein with 
the following defined meanings. 

Two nucleic acid fragments are "homologous" 
if they are capable of hybridizing to one another under 
35 hybridization conditions described in Maniatis et al . , 
op. cit. , pp. 320-323. However, by using the following 
wash conditions — 2 x SCC, 0.1% SDS, room t mperature - 
twice, 30 minut s each; then 2 x SCC, 0.1% SDS, 50°C - 
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once, 30 minutes; then 2 x SCC, room temperature twice , 
10 minutes each — homologous sequences can be 
identified that contain at most about 25-30% basepair 
mismatches. More preferably, homologous nucleic acid 
5 strands contain 15-25% basepair mismatches, even more 
preferably 5-15% basepair mismatches. These degrees of 
homology can be selected by using more stringent wash 
conditions for identification of clones from gene 
libraries (or other sources of genetic material), as is 

10 well known in the art. 

A DNA fragment is "derived from" an IGFBP-5- 
encoding DNA sequence if it has the same or substan- 
tially the same basepair sequence as a region of the 
coding sequence for the entire IGFBP-5 molecule. 

15 Substantially the same means, when referring 

to biological activities, that the activities are of 
the same type although they may differ in degree. When 
referring to amino acid sequences, substantially the 
same means that the molecules in question have similar 

20 biological properties and preferably have at least 85% 
homology in amino acid sequences. More preferably, the 
amino acid sequences are at least 90% identical. ,In 
other uses, substantially the same has its ordinary 
English language meaning. 

25 A protein is "derived from" an IGFBP-5 

molecule if it has the same or substantially the same 
amino acid sequence as a region of the IGFBP-5 
molecule . 

IGFBP-5, both glycosylated and 
30 unglycosylated, or polypeptide derivatives thereof, may 
be used for producing antibodies, either monoclonal or 
polyclonal, specific to IGFBP-5. By polypeptide 
derivatives of these IGFBPs is meant polypeptides 
differing in length from natural IGFBP-5 and containing 
35 five or more amino acids from IGFBP-5 in the same 

primary order as found in IGFBP-5 as obtained from a 
natural source. Polypeptide molecules having 
substantially the same amino acid sequence as IGFBP-5 
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replacements are contemplated. Conservative replace- 
ments are those that take place within a family of 
amino acids that are related in their side chains. 
Genetically encoded amino acids are generally divided 

5 into four families: (1) acidic - aspartate, glutamate; 
(2) basic = lysine, arginine, histidine; (3) non-polar = 
alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan; and 
(4) uncharged polar = glycine, asparagine, glutamine, 

10 cystine, serine, threonine, tyrosine. Phenylalanine, 
tryptophan, and tyrosine are sometimes classified 
jointly as aromatic amino acids. For example, it is 
reasonable to expect that an isolated replacement of a 
leucine with an isoleucine or valine, an aspartate with 

15 a glutamate, a threonine with a serine, or a similar 

replacement of an amino acid with a structurally related 
amino acid will not have a major effect on the binding 
properties of the resulting molecule, especially if the 
replacement does not involve an amino acid at a binding 

20 site involved in the interaction of IGFBP-5 or its 

derivatives with an IGF molecule. Whether an amino acid 
change results in a functional peptide can readily be 
determined by assaying the specific binding properties 
of the IGFBP-5 polypeptide derivative. A binding assay 

25 is described in detail below. 

Antibodies specific for IGFBP-5 are produced 
by immunizing an appropriate vertebrate host, e.g., 
rabbit, with purified IGFBP-5 or polypeptide derivatives 
of IGFBP-5, by themselves or in conjunction with a 

30 conventional adjuvant. Usually, two or more immuni- 
zations will be involved, and blood or spleen will be 
harvested a few days after the last injection. For 
polyclonal antisera, the immunoglobulins can be pre- 
cipitated, isolated and purified by a variety of 

35 standard techniques, including affinity purification 

using IGFBP-5 attached to a solid surface, such as a gel 
or beads in an affinity column. For monoclonal 
antibodies, the splenocytes normally will be fused with 
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an immortalized lymphocyte, e.g., a myeloid cell line, 
under selective conditions for hybridoma formation. 
The hybridomas can then be cloned under limiting 
dilution conditions and their supernatants screened for 
5 antibodies having the desired specificity. Techniques 
for producing antibodies are well known in the 
literature and are exemplified by the publication 
Antibodies: * T.^hox-atorv Manual (1988) eds. Harlow and 
Lane, Cold Spring Harbor Laboratories Press, and U.S. 

10 Patent Nos. 4,381,292, 4,451,570, and 4,618,577. 

IGPBP-5 can be readily purified from blood 
and its components, such as serum and plasma and from 
cells genetically modified to produce IGFBP-5 or 
polypeptide derivatives thereof, by affinity chroma- 

15 tography using a monoclonal antibody specific for 

IGFBP-5. In addition to the use of antibody affinity 
chromatography, IGFBP-5 and polypeptide derivatives 
thereof can be purified by a variety of other widely 
known protein purification techniques (either alone or 

20 in combination) including immunoprecipitation, gel 
filtration, ion exchange chromatography, chromato- 
focusing, isoelectric focusing, selective precipita- 
tion, electrophoresis, and the like. Fractions 
isolated during purification procedures can be analyzed 

25 for the presence of IGFBP-5 or polypeptide derivatives 
of IGFBP-5 by immunoassays employing IGFBP-5 -specif ic 
antibodies or IGFBP-5-specific bioassays. Detailed 
examples are provided below. 

Isolation of nucleotide sequences encoding 

30 IGFBP-5 involves creation of either a genomic library 
prepared from cells encoding IGFBP-5 or preparation of 
a cDNA library from RNA isolated from cells expressing 
IGFBP-5. It will generally be preferable to create a 
cDNA library for isolation of lGFBP-5-coding nucleotide 

35 sequences so as to avoid any possible problems arising 
from attempts to d termin intron/ xon borders. Genetic 
libraries can be made in either eukaryotic or 
prokaryotic host cells. Widely available cloning 
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vectors such as plasmids, cosmids, phage, YACs and the 
lik can be used to generate genetic libraries suitable 
for the isolation of nucleotide sequences encoding 
IGFBP-5 or portions thereof. 

Useful methods for screening genetic 
libraries for the presence of IGFBP-5 nucleotide 
sequences include the preparation of oligonucleotide 
probes based on the N-terminus amino acid sequence 
information from purified IGFBP-5 or purified internal 
fragments of purified IGFBP-5. By employing the 
standard triplet genetic code, oligonucleotide 
sequences of about 17 base pairs or longer can be 
prepared by conventional in vitro synthesis techniques 
so as to correspond to portions of IGFBP-5 for which 
the amino acid sequence has been determined by N- 
terminus analysis. The resultant nucleic acid 
sequences can be subsequently labeled with radio- 
nuclides, enzymes, biotin, fluorescers, or the like, 
and used as probes for screening genetic libraries. 

Additional methods of interest for isolating 
IGFBP-5-encoding nucleic acid sequences include 
screening genetic libraries for the expression of 
IGFBP-5 or fragments thereof by means of IGFBP-5 
specific antibodies, either polyclonal or monoclonal. 
A particularly preferred technique involves the use of 
degenerate primers based on partial amino acid 
sequences of purified IGFBP-5 or on sequences from 
known related molecules and the polymerase chain 
reaction (PCR) to amplify gene segments between the 
primers. The gene can then be isolated using a 
specific hybridization probe based on the amplified 
gene segment, which is then analyzed for appropriate 
expression of protein. A detailed description of this 
preferred technique is set forth in the examples that 
follow. 

Nucleotide sequences encoding IGFBP-5 can be 
obtained from recombinant DNA molecules recovered from 
IGFBP-5 genetic library isolates. The nucleotide 
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sequence encoding IGFBP-5 can be obtained by sequencing 
the non-vector nucleotide sequences of these 
recombinant molecules. Nucleotide sequence information 
can be obtained by employing widely used DNA sequencing 
protocols, such as Maxim and Gilbert sequencing, dideoxy 
nucleotide sequencing, and the like. Examples of 
suitable nucleotide sequencing protocols can be found in 
Berger and Kimmel, Methods in Enzvmoloav Vol. 52 , Guide 
to Molecular Cloning Techniques , (1987) Academic Press. 
Nucleotide sequence information from several recombinant 
DNA isolates, including isolates from both cDNA and 
genomic libraries, may be combined so as to provide the 
entire amino acid coding sequence of IGFBP-5, as well as 
the nucleotide sequences of introns within the IGFBP-5 
15 genes, upstream nucleotide seguences, and downstream 
nucleotide sequences. 

Nucleotide seguences obtained from sequencing 
IGFBP-5 specific genetic library isolates are subjected 
to analysis in order to identify regions of interest in 
the IGFBP-5 genes. These regions of interest include 
open reading frames, introns, promoter seguences, 
termination sequences, and the like. Analysis of 
nucleotide sequence information is preferably performed 
by computer. Software suitable for analyzing nucleotide 
sequences for regions of interest is commercially 
available and includes, for example, DNASIS™ (LKB) . It 
is also of interest to use amino acid sequence 
information obtained from the N-terminus sequencing of 
purified IGFBP-5 when analyzing IGFBP-5 nucleotide 
30 sequence information so as to improve the accuracy of 
the nucleotide sequence analysis. 

Isolated nucleotide sequences encoding 
IGFBP-5 can be used to produce purified IGFBP-5 or 
fragments thereof by either recombinant DNA methodology 
35 or by in vitro polypeptide synthesis techniques. By 

"purified" and "isolated" is meant, wh n referring to a 
polypeptide or nucleotide sequence, that the indicated 
molecule is present in the substantial absence of other 
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biological macromolecules of the same type. The term 
"purified" as used herein preferably means at least 95% 
by weight, more preferably at least 99% by weight/ and 
most preferably at least 99.8% by weight, of biological 
5 macromolecules of the same type present (but water, 
buffers, and other small molecules, especially mole- 
cules having a molecular weight of less than 1000, can 
be present) . 

A significant advantage of producing IGFBP-5 

10 by recombinant DNA techniques rather than by isolating 
IGFBP-5 from natural sources is that equivalent 
quantities of IGFBP-5 can be produced by using less 
starting material than would be required for isolating 
the binding protein from a natural source. Producing 

15 IGFBP-5 by recombinant techniques also permits IGFBP-5 

to be isolated in the absence of some molecules normally 
present in cells that naturally produce IGFBP-5. 
Indeed, IGFBP compositions entirely free of any trace of 
human protein contaminants can readily be produced since 

20 the only human protein produced by the recombinant non- 
human host is the recombinant IGFBP. Potential viral 
agents from natural sources are also avoided. It is 
also apparent that recombinant DNA techniques can be 
used to produce IGFBP-5 polypeptide derivatives that 

25 are not found in nature, such as the variations 
described above. 

IGFBP-5 and polypeptide derivatives of 
IGFBP-5 can be expressed by recombinant techniques when 
a DNA sequence encoding the relevant molecule is 

30 functionally inserted into a vector. By "functionally 
inserted" is meant in proper reading frame and orienta- 
tion, as is well understood by those skilled in the art. 
When producing a genetic construction containing a 
complete IGFBP-5 reading frame, the preferred starting 

35 material is a cDNA library isolate encoding IGFBP-5 

rather than a genomic library isolate. Typically, the 
IGFBP-5 gene will be inserted downstream from a promoter 
and will be followed by a stop codon, although 



WO 92/03470 



PCT/US91/06140 



19 

production as a hybrid prot in followed by cleavage may 
be used, if desired. In general, host-cell-specific 
sequences improving the production yield of IGFBP-5 and 
IGFBP-5 polypeptide derivatives will be used and 
5 appropriate control sequences will be added to the 
expression vector, such as enhancer sequences, 
polyadenylation sequences, and ribosome binding sites* 

Once the appropriate coding sequence is 
isolated, it can be expressed in a variety of different 
10 expression systems. 

Mammalian Expression Systems 

A mammalian promoter is any DNA sequence 

15 capable of binding mammalian RNA polymerase and 

initiating the downstream (3 r ) transcription of a 
coding sequence (e.g. structural gene) into mRMA. A 
promoter will have a transcription initiating region, 
which is usually placed proximal to the 5' end of the 

20 coding sequence, and a TATA box, usually located 25-30 
base pairs (bp) upstream of the transcription initia- 
tion site. The TATA box is thought to direct RNA 
polymerase II to begin RNA synthesis at the correct 
site. A mammalian promoter will also contain an 

25 upstream promoter element, typically located within 100 
to 200 bp upstream of the TATA box. An upstream 
promoter element determines the rate at which 
transcription is initiated and can act in either 
orientation [Sambrook et al. (1989) "Expression of 

30 Cloned Genes in Mammalian Cells." In Molecular 
Cloning; A Laboratory Manual, 2nd ed.1 . 

Mammalian viral genes are often highly 
expressed and have a broad host range; therefore 
sequences encoding mammalian viral genes provide 

35 particularly useful promoter sequences. Examples 

include the SV40 early promoter, mouse mammary tumor 
virus LTR promoter f adenovirus major late promoter (Ad 
MLP), and herpes simplex virus promoter. In addition, 
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sequences derived from non-viral genes, such as the 
murine metallothionein gen , also provide useful 
promoter sequences. Expression may be either 
constitutive or regulated (inducible), depending on the 
5 promoter can be induced with glucocorticoid in hormone- 
responsive cells. 

The presence of an enhancer element 
(enhancer), combined with the promoter elements 
described above, will typically increase expression 

10 levels. An enhancer is a regulatory DNA sequence that 
can stimulate transcription up to 1000-fold when linked 
to homologous or heterologous promoters, with synthesis 
beginning at the normal RNA start site. Enhancers are 
also active when they are placed upstream or downstream 

15 from the transcription initiation site, in either normal 
or flipped orientation, or at a distance of more than 
1000 nucleotides from the promoter [Maniatis et al . 
(1987) Science 236: 1237; Alberts et al. (1989) Molecular 
Biology of the Cell , 2nd ed.]. Enhancer elements 

20 derived from viruses may be particularly useful, 
because they typically have a broader host range. 
Examples include the SV40 early gene enhancer [Dijkema 
et al (1985) EMBQ J. 4 :7611 and the enhancer /promoters 
derived from the long terminal repeat (LTR) of the Rous 

25 Sarcoma Virus [Gorman et al. (1982b) Proc. Natl. Acad. 

Sci. 79 ;67771 and from human cytomegalovirus [Boshart et 
al. (1985) Cell 41 ;5211. Additionally, some enhancers 
are regulatable and become active only in the presence 
of an inducer, such as a hormone or metal ion [Sassone- 

30 Corsi and Borelli (1986) Trends Genet. 2 ; 215; Maniatis 
et al. (1987) Science 236:1237]. 

A DNA molecule may be expressed intracellu- 
larly in m a mm alian cells. A promoter sequence may be 
directly linked with the DNA molecule, in which case 

35 the first amino acid at the N-terminus of the recom- 
binant protein will always be a methionine, which is 
encoded by the ATG start codon. If desired, the N- 
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signals (also called splice donor and acceptor sites) 
[see e.g., Gothing and Sambrook (1981) Nature 293 :620] . 
Introns are intervening noncoding sequences within a 
coding sequence that contain splice donor and acceptor 
5 sites. They are removed by a process called 

"splicing," following polyadenylation of the primary 
transcript [Nevins (1983) Annu. Rev. Biochem. 52 ;441; 
Green (1986) Annu. Rev. Genet. 20 ;671; Padgett et al. 
(1986) Annu. Rev. Biochem. 55 :1119; Krainer and 

10 Maniatis (1988) "KNA splicing. " In Transcription and 
splicing (ed. B.D. Hames and D.M. Glover)]. 

Typically, the above described components, 
comprising a promoter, polyadenylation signal, and 
trams cript ion termination sequence are put together 

15 into expression constructs. Enhancers, introns with 

functional splice donor and acceptor sites, and leader 
sequences may also be included in an expression 
construct, if desired. Expression constructs are often 
maintained in a replicon r such as an extrachromosomal 

20 element (e.g., plasmids) capable of stable maintenance 
in a host, such as mammalian cells or bacteria. 
Mammalian replication systems include those derived from 
animal viruses, which require trans-acting factors to 
replicate. For example, plasmids containing the 

25 replication systems of papovaviruses , such as SV40 

[Gluzman (1981) Cell 23 :1751 or polyomavirus , replicate 
to extremely high copy number in the presence of the 
appropriate viral T antigen. Additional examples of 
mammalian replicons include those derived from bovine 

30 papillomavirus and Epstein-Barr virus. Additionally, 
the replicon may have two replication systems, thus 
allowing it to be maintained, for example, in mammalian 
cells for expression and in a procaryotic host for 
cloning and amplification. Examples of such mammalian- 

35 bacteria shuttle vectors include pMT2 [Kaufman et al . 

(1989) Mol. Cell. Biol. 9 :946 and pHEBO [Shimizu et al. 
(1986) Mol. Cell. Biol. 6 :1074]. 
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Baculovirus Expression Systems 

A baculovirus promoter is any DNA sequence 
capable of binding a baculovirus RNA polymerase and 
5 initiating the downstream (3') transcription of a 

coding sequence (e.g. structural gene) into mRNA. A 
promoter will have a transcription initiation region 
which is usually placed proximal to the 5' end of the 
coding sequence. This transcription initiation region 
10 typically includes an RNA polymerase binding site and a 
transcription initiation site. A baculovirus promoter 
may also have a second domain called an enhancer, which, 
if present, is usually distal to the structural gene. 
Expression may be either regulated or constitutive. 
15 Sequences encoding genes abundantly 

transcribed at late times in the infection cycle 
provide particularly useful promoter sequences. 
Examples include sequences derived from the polyhedrin 
[Friesen et al. (1986) "The Regulation of Baculovirus 
20 Gene Expression," in: The Molecular Biology of 

Baculoviruses (ed. Walter Doerf ler) ; E.P.O. Pub. Nos. 
127,839 and 155,476] and plO [Vlak et al. (1988) J* Gen. 
Virol. 69 ; 765] genes. 

A DNA molecule may be expressed intracellu- 
25 larly. A promoter sequence may be directly linked with 
the DNA molecule, in which the case the first amino acid 
at the N- terminus of the recombinant protein will always 
be a methionine, which is encoded by the ATG start 
codon. If desired, methionine at the N-terminus may be 
30 cleaved from the protein by in vitro incubation with 
cyanogen bromide. 

Fusion proteins provide an alternative to 
direct expression. Typically, a DNA sequence encoding 
the N- terminal portion of an endogenous yeast protein, 
35 or other stable protein, is fused to the 5' end of 

heterologous coding sequences. Upon expression, this 
construct will provide a fusion of the two amino acid 
sequ nces. For example, the N-terminus of the 
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polyhedrin gene may be linked at the 5' terminus of a 
foreign gene and expressed in yeast. The DNA sequence 
at the junction of the two amino acid sequences may or 
may not encode a cleavable site. See e.g., Luckow et 
5 al. (1988) Bio /technology 6 :47. 

Alternatively, foreign proteins can also be 
secreted from the cell by creating chimeric DNA 
molecules that encode a fusion protein comprised of a 
leader sequence fragment that provides for secretion of 

10 the foreign protein in insects. The leader sequence 

fragment typically encodes a signal peptide comprised of 
hydrophobic amino acids which direct the secretion of 
the protein from the cell. 

DNA encoding suitable signal sequences can be 

15 derived from genes for secreted insect or baculovirus 
proteins, such as the baculovirus polyhedrin gene 
[Carbonell et al. (1988) Gene 73 :409] . Alternatively, 
leaders of non-baculovirus origin, such as those derived 
from genes encoding human alpha-interf eron [Maeda et al . 

20 (1985) Nature 315 :5921, human gastrin-releasing peptide 
[Lebacq-Verheyden et al. (1988) Molec. Cell. Biol. 
8:3129], human IL-2 [Smith et al. (1985) Proc. Natl. 
Acad. Sci. USA 82 :84041, mouse IL-3 [Miyajima et al. 
(1987) Gene 58 :2731, and human glucocerebrosidase 

25 [Martin et al. (1988) DNA 7 :991 also provide for 
secretion in insects. 

Typically, transcription termination 
sequences recognized by insects are regulatory regions 
located 3' to the translation stop codon, and thus 

30 together with the promoter flank the coding sequence. 
These sequences direct the transcription of an mRNA 
which can be translated into the polypeptide encoded by 
the DNA. Examples include transcription termination 
sequences derived from the polyhedrin gene [Miller et 

35 al. (1988) Ann. Rev. Microbiol. 42 :1771. 

Prior to insertion of the foreign gene into 
the baculovirus genome, the above described components, 
comprising a promoter, leader (if desired), coding 
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sequence of interest/ and transcription termination 
sequence, are typically put together into an 
intermediate transplacement construct. Intermediate 
transplacement constructions are often maintained in a 
5 replicon, such as an extrachromosomal element (e.g., 

plasmids) capable of stable maintenance in a host, such 
as bacteria. The replicon will have a replication 
system, thus allowing it to be maintained in a 
prokaryotic host for cloning and amplification. The 
10 promoter and transcription termination sequence of the 

construct will typically comprise a 2.5kb section of the 
baculovirus genome for integration of the foreign gene 
into the baculovirus genome by double crossover 
recombination events, producing a baculovirus expression 
15 vector [Miller et al. (1989) Bloessays 4 s 911 . The 

baculovirus expression vector is typically packaged into 
an infectious recombinant baculovirus. 

When using baculovirus expression vectors, 
selectable markers are, such as antibiotic resistance 
20 genes, are generally not used. Selection is typically 
by visual inspection for occlusion bodies. Examples 
are given elsewhere in this specification of the use of 
selectable markers. 

Recombinant baculovirus expression vectors 
25 have been developed for infection into several insect 
cells. For example, recombinant baculoviruses have 
been developed for inter alia: Aedes aegypti, 
Autographa californica, Bombyx mori, Drosophila 
melanogaster , Helxothis zea, Spodoptera frugiperda, and 
30 Trichoplusxa ni [P.C.T. WO 89/046699; Carbonell et al. 
(1985) J. Virol. 56 ; 153: Smith et al. (1983) Mol. 
Cell. Biol. 3 :2156; Wright (1986) Nature 321 :718: See 
generally, Fraser et al. (1989) In Vitro Cell. Dev. 
Biol. 25 :2251. 

35 Methods of introducing exogenous DNA into 

insect hosts are well-known in the art, and typically 
include either the transfection of host insect cells 
with DNA or the infection of insect cells or live 
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insects, usually larvae, with virus. Transfection 
procedures are based on the calcium phosphate procedure 
originally developed for mammalian cells [Graham et al . 
(1973) Virology 52x456]. DNA transfection and viral 
5 infection procedures usually vary with the insect genus 
to be transformed. See e.g. Autograph [Cars tens et al. 
(1980) Virology 101:311], Heliothis (virescens) [P.C.T. 
Pub. No, WO88/02030], Spodoptera [Kang (1988) 
"Baculovirus Vectors for Expression of Foreign Genes," 
10 ins Advances in Virus Research , vol. 35]. 

Bacterial Expression Systems 

A bacterial promoter is any DNA sequence 
capable of binding bacterial RNA polymerase and 

15 initiating the downstream (3 H ) transcription of a 

coding sequence (e.g., structural gene) into mRNA. A 
promoter will have a transcription initiation region 
which is usually placed proximal to the 5' end of the 
coding sequence. This transcription initiation region 

20 typically includes an RNA polymerase binding site and a 
transcription initiation site. A bacterial promoter 
may also have a second domain called an operator, that 
may overlap an adjacent RNA polymerase binding site at 
which RNA synthesis begins. The operator permits 

25 negative regulated (inducible) transcription, as a gene 
repressor protein may bind the operator and thereby 
inhibit transcription of a specific gene. Constitutive 
expression may occur in the absence of negative 
regulatory elements, such as the operator. In addition, 

30 positive regulation may be achieved by a gene activator 
protein binding sequence, which, if present is usually 
proximal (5') to the RNA polymerase binding sequence. 
An example of a gene activator protein is the catabolite 
activator protein (CAP), which helps initiate 

35 transcription of the lac operon in Escherichia coli (E. 
coli) [Raibaud et al. (1984) Annu. Rev. Genet. 18:173]. 
Regulated expression may therefore be either positive or 
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negative, thereby either enhancing or reducing 

transcription . 

Sequences encoding metabolic pathway enzymes 
provide particularly useful promoter sequences. 
5 Examples include promoter sequences derived from sugar 
metabolizing enzymes, such as galactose, lactose (lac) 
[Chang et al. (1977) Nature 198:1056], and maltose. 
Additional examples include promoter sequences derived 
from biosynthetic enzymes such as tryptophan (trp_) 

10 [Goeddel et al. (1980) wuc. Acids Res. 8:4057; 

Yelverton et al. (1981) Wucl. Acids Res. 9:731; U.S. 
Patent No. 4,738,921; E.P.O. Pub. Nos. 36,776 and 
121,775]. The 7-lactamase (bla) promoter system 
[Weissmann (1981) "The cloning of interferon and other 

15 mistakes." In Interferon 3 (ed. I. Gresser)], 

bacteriophage lambda PL [Shimatake et al. (19B1) Nature 
292:128] and T5 [U.S. Patent No. 4,689,406] promoter 
systems also provide useful promoter sequences. 

In addition, synthetic promoters which do not 

20 occur in nature also function as bacterial promoters. 

For example, transcription activation sequences of one 
bacterial or bacteriophage promoter may be joined with 
the operon sequences of another bacterial or 
bacteriophage promoter, creating a synthetic hybrid 

25 promoter [U.S. Patent No. 4,551,433]. For example, the 
tac promoter is a hybrid trp-lac promoter comprised of 
both trp promoter and lac operon sequences that is 
regulated by the lac repressor [Amann et al. (1983) Gene 
25:167; de Boer et al. (1983) Proc. Nat l. Acad. Sci. 

30 80:21]. Furthermore, a bacterial promoter can include 
naturally occurring promoters of non-bacterial origin 
that have the ability to bind bacterial RNA polymerase 
and initiate transcription. A naturally occurring 
promoter of non-bacterial origin can also be coupled 

35 with a compatible RNA polymerase to produce high levels 
of expression of some genes in prokaryotes. The 
bact riophage T7 RNA polymerase/promoter system is an 
example of a coupled promoter system [Studier et al. 
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(1986) J, Mol. Biol. 189:113; Tabor et al. (1985) Proc 
Natl. Acad. Sci. 82:1074]. In addition, a hybrid 
promoter can also be comprised of a bacteriophage 
promoter and an coli operator region (E.F.O. Pub. 
5 No. 267,851). 

In addition to a functioning promoter 
sequence, an efficient ribosome binding site is also 
useful for the expression of foreign genes in 
prokaryotes. In E. coli , the ribosome binding site is 

10 called the Shine-Dalgarno (SD) sequence and includes an 
initiation codon (ATG) and a sequence 3-9 nucleotides in 
length located 3-11 nucleotides upstream of the 
initiation codon [Shine et al. (1975) Nature 254:34]. 
The SD sequence is thought to promote binding of mRNA to 

15 the ribosome by the pairing of bases between the SD 

sequence and the 3' and of B. coli 16S rRNA [Steitz et 
al . (1979) "Genetic signals and nucleotide sequences in 
messenger RNA. M In Biological Regulation and 
Development; Gene Expression (ed. R.F. Goldberger) ] . To 

20 express eukaryotic genes and prokaryotic genes with weak 
ribosome-binding site [Sambrook et al. (1989) 
"Expression of cloned genes in Escherichia coli." In 
Molecular Cloning: A Laboratory Manual ! . 

A DNA molecule may be expressed intracellu- 

25 larly. A promoter sequence may be directly linked with 
the DNA molecule, in which case the first amino acid at 
the N-terminus will always be a methionine, which is 
encoded by the ATG start codon. If desired, methionine 
at the N-terminus may be cleaved from the protein by in 

30 vitro incubation with cyanogen bromide or by either in 
vivo on in vitro incubation with a bacterial methionine 
N-terminal peptidase (E.P.O. Pub. No. 219,237). 

Fusion proteins provide an alternative to 
direct expression. Typically, a DNA sequence encoding 

35 the N-terminal portion of an endogenous bacterial 

protein, or oth r stable protein, is fused to the 5' 
end of heterologous coding sequences. Upon expression, 
this construct will provide a fusion of the two amino 
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acid sequences. For xampl , the bacteriophage lambda 
cell gene can be linked at the 5' terminus of a foreign 
gene and expressed in bacteria. The resulting fusion 
protein preferably retains a site for a processing 
5 enzyme (factor Xa) to cleave the bacteriophage protein 
from the foreign gene [Nagai et al, (1984) Nature 
309:810]. Fusion proteins can also be made with 
sequences from the lac Z [Jia et al. (1987) Gene 60:197] , 
trpE [Allen et al. (1987) J. Biotechnol. 5:93; Makoff et 
10 al. (1989) J. Gen. Microbiol. 135:11], and Chey [E.P.O. 
Pub. No. 324,647] genes. The DNA sequence at the 
junction of the two amino acid sequences may or may not 
encode a cleavable site. Another example is a ubiguitin 
fusion protein. Such a fusion protein is made with the 
15 ubiquitin region that preferably retains a site for a 
processing enzyme (e.g. ubiquitin specific processing- 
protease) to cleave the ubiquitin from the foreign 
protein. Through this method, native foreign protein 
can be isolated [Miller et al. (1989) Bio/Technology 
20 7:698]. 

Alternatively, foreign proteins can also be 
secreted from the cell by creating chimeric DNA 
molecules that encode a fusion protein comprised of a 
signal peptide sequence fragment that provides for 

25 secretion of the foreign protein in bacteria [U.S. 

Patent No. 4,336,336]. The signal sequence fragment 
typically encodes a signal peptide comprised of 
hydrophobic amino acids which direct the secretion of 
the protein from the cell. The protein is either 

30 secreted into the growth media (gram-positive bacteria) 
or into the periplasmic space , located between the inner 
and outer membrane of the cell (gram-negative bacteria). 
Preferably there are processing sites, which can be 
cleaved either in vivo or in vitro encoded between the 

35 signal peptide fragment and the foreign gene. 

DNA encoding suitable signal sequences can be 
derived from genes for secreted bacterial proteins r 
such as the E. coli outer membrane protein gene ( ompA ) 
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[Masui et al. (1983) , in: Experimental Manipulation of 
Gene Expr ssion ; Ghrayeb et al. (1984) EMBO J. 3t 2437 ] 
and the E . coli alkaline phosphatase signal sequence 
(phoA) [Oka et al. (1985) Proc. Natl. Acad. Sci. 
5 82:7212]. As an additional example, the signal 
sequence of the alpha-amylase gene from various 
Bacillus strains can be used to secrete heterologous 
proteins from B. subtilis [Palva et al. (1982) Proc. 
Natl. Acad. Sci. USA 79 :5582; E.P.O. Pub. No. 244,042]. 

10 Typically, transcription termination 

sequences recognized by bacteria are regulatory regions 
located 3' to the translation stop codon, and thus 
together with the promoter flank the coding sequence. 
These sequences direct the transcription of an mRNA 

15 which can be translated into the polypeptide encoded by 
the DNA. Transcription termination sequences frequently 
include DNA sequences of about 50 nucleotides capable of 
forming stem loop structures that aid in terminating 
transcription. Examples include transcription 

20 termination sequences derived from genes with strong 
promoters, such as the trp gene in E. coli as well as 
other biosynthetic genes. 

Typically, the above described components, 
comprising a promoter, signal sequence (if desired), 

25 coding sequence of interest, and transcription 

termination sequence, are put together into expression 
constructs. Expression constructs are often maintained 
in a replicon, such as an extrachromosomal element 
(e.g., plasmids) capable of stable maintenance in a 

30 host, such as bacteria. The replicon will have a 

replication system, thus allowing it to be maintained 
in a procaryotic host either for expression or for 
cloning and amplification. In addition, a replicon may 
be either a high or low copy number plasmid. A high 

35 copy number plasmid will generally have a copy number 
ranging from about 5 to about 200, and typically about 
10 to about 150. A host containing a high copy number 
plasmid will preferably contain at least about 10, and 
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more preferably at least about 20 plasmids. Either a 
high or low copy number v ctor may be select d, 
depending upon the effect of the vector and the foreign 
protein on the host. 
5 Alternatively , the expression constructs can 

be integrated into the bacterial genome with an 
integrating vector. Integrating vectors typically 
contain at least one sequence homologous to the 
bacterial chromosome that allows the vector to 
10 integrate. Integrations appear to result from 

recombinations between homologous DNA in the vector and 
the bacterial chromosome. For example, integrating 
vectors constructed with DNA from various Bacillus 
strains integrate into the Bacillus chromosome (E.P.O. 
15 Pub. No. 127 , 328). Integrating vectors may also be 
comprised of bacteriophage or transposon sequences. 

Typically, extrachromosomal and integrating 
expression constructs may contain selectable markers to 
allow for the selection of bacterial strains that have 
20 been transformed. Selectable markers can be expressed 

in the bacterial host and may include genes which render 
bacteria resistant to drugs such as ampicillin, 
chloramphenicol, erythromycin, kanamycin (neomycin), and 
tetracycline [Davies et al. (1978) Annu . Rev . Microbiol . 
25 32:469]. Selectable markers may also include 

bio synthetic genes, such as those in the histidine, 
tryptophan, and leucine biosynthetic pathways. 

Alternatively, some of the above described 
components can be put together in transformation 
30 vectors. Transformation vectors are typically 
comprised of a selectable market that is either 
maintained in a replicon or developed into an 
integrating vector, as described above. 

Expression and transformation vectors, either 
35 extra-chromosomal replicons or integrating vectors, have 
been developed for transformation into many bacteria. 
For example, expression vectors have been developed for, 
inter alia , the following bacteria: Bacillus subtilis 
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Streptococcal Genetics (ed. J. Ferretti and R* Curtiss 
III); Perry et al. (1981) Infec. Immun. 32:1295; Powell 
et al. (1988) Appl. Environ. Microbiol. 54:655; Somkuti 
et al . (1987) Proc. 4th Evr. Cong. Biotechnology 1:412, 
5 Streptococcus ] . 

Description: Yeast Expression System 

A yeast promoter is any DNA sequence capable 
of binding yeast RNA polymerase and initiating the 
10 downstream (3') transcription of a coding sequence 

(e.g. structural gene) into mRKA. A promoter will have 
a transcription initiation region which is usually 
placed proximal to the 5' end of the coding sequence. 
This transcription initiation region typically includes 
15 an RNA polymerase binding site (the "TATA Box") and a 
transcription initiation site. A yeast promoter may 
also have a second domain called an upstream activator 
sequence (UAS), which, if present, is usually distal to 
the structural gene. The UAS permits regulated 
20 (inducible) expression. Constitutive expression occurs 
in the absence of a UAS. Regulated expression may be 
either positive or negative, thereby either enhancing or 
reducing transcription. 

Yeast is a fermenting organism with an active 
25 metabolic pathway, therefore sequences encoding enzymes 
in the metabolic pathway provide particularly useful 
promoter sequences. Examples include alcohol 
dehydrogenase (ADH)(E.P.O. Pub. No. 284044), enolase, 
glucokinase, glucose-6-phosphate isomerase, 
30 glyceraldehyde-3-phosphate-dehydrogenase (GAP or 
GAPDH) , hexokinase, phosphofructokinase, 3- 
phosphoglycerate mutase, and pyruvate kinase 
(PyK)(E.P.O. Pub. No. 329203). The yeast PH05 gene, 
encoding acid phosphatase, also provides useful 
35 promoter sequences [Myanohara et al. (1983) Proc. Natl. 
Acad. Sci. USA 80:1]. 

In addition, synthetic promoters which do not 
occur in nature also function as yeast promoters. For 



WO 92/03470 



PCT/US91/06140 



34 

example, DAS sequences of one yeast promoter may be 
joined with the transcription activation region of 
another yeast promoter, creating a synthetic hybrid 
promoter. Examples of such hybrid promoters include 
5 the ADH regulatory sequence linked to the GAP 

transcription activation region (U.S. Patent Nos. 
4,876,197 and 4,880,734). Other examples of hybrid 
promoters include promoters which consist of the 
regulatory sequences of either the ADH2 , GAL4, GAL10 , 

10 OR PH05 genes, combined with the transcriptional 

activation region of a glycolytic enzyme gene such as 
GAP or PyK(E.P.O. Pub. No. 164556). Furthermore, a 
yeast promoter can include naturally occurring 
promoters of non-yeast origin that have the ability to 

15 bind yeast RNA polymerase and initiate transcription. 

Examples of such promoters include, inter alia , [Cohen 
et al. (1980) Proc. Natl. Acad. Sci. USA 77; 1078; 
Henikoff et al. (1981) Nature 283 :835; Hollenberg et al. 
(1981) Curr. Topics Microbiol. Immunol. 96 : 119; 

20 Hollenberg et al. (1979) "The Expression of Bacterial 
Antibiotic Resistance Genes i the Yeast Saccharomyces 
cerevisiae," in: Plasmids of Medical, Environmental and 
Commercial Importance (eds. K.N. Timmis and A. Puhler); 
Mercerau-Puigalon et al. (1980) Gene 11:163; Panthier et 

25 al. (1980) Curr. Genet. 2:109:1. 

A DNA molecule may be expressed intracellu- 
larly in yeast. A promoter sequence may be directly 
linked with the DNA molecule, in which case the first 
amino acid at the N-terminus of the recombinant protein 

30 will always be a methionine, which is encoded by the ATG 
start codon. If desired, methionine at the N-terminus 
may be cleaved from the protein by in vitro incubation 
with cyanogen bromide. 

Fusion proteins provide an alternative to 

35 direct expression. Typically, a DNA sequence encoding 
the N-terminal portion of an endogenous yeast protein, 
or other stable protein, is fused to the 5' end of 
heterologous coding sequences. Upon expression, this 



35 

construct will provide a fusion of the two amino acid 
sequences. For example , the yeast or human superoxide 
dismutase (SOD) gene r can be linked at the 5' terminus 
of a foreign gene and expressed in yeast. The DNA 
sequence at the junction of the two amino acid 
sequences may or may not encode a cleavable site. See 
e.g., E.P.O. Pub. No* 196056. Another example is a 
ubiquitin fusion protein. Such a fusion protein is 
made with the ubiquitin region that preferably retains 
a site for a processing enzyme (e.g. ubiquitin- 
specific processing protease) to cleave the ubiquitin 
from the foreign protein. Through this method, 
therefore, native foreign protein can be isolated 
(P.C.T. WO 88/024066; commonly owned U.S. Patent 
Application Serial No. 359, 599 , filed 7 August 1989, 
the disclosure of which is Incorporated herein by 
reference) . This system is the currently preferred 
system for producing IGFBP-5. 

Alternatively, foreign proteins can also be 
secreted from the cell into the growth media by 
creating chimeric DNA molecules that encode a fusion 
protein comprised of a leader sequence fragment that 
provide for secretion in yeast of the foreign protein. 
Preferably, there are processing sites encoded between 
the leader fragment and the foreign gene that can be 
cleaved either in vivo or in vitro . The leader sequence 
fragment typically encodes a signal peptide comprised of 
hydrophobic amino acids which direct the secretion of 
the protein from the cell. 

DNA encoding suitable signal sequences can be 
derived from genes for secreted yeast proteins, such as 
the yeast invertase gene (E.P.O. Pub. No. 12873; J. P.O. 
Pub. No. 62,096,086) and the A-f actor gene (U.S. Patent 
No. 4,588,684). Alternatively, leaders of non-yeast 
origin, such as an interferon leader, exist that also 
provide for secretion in yeast (E.P.O. Pub. No. 60057). 

A preferred class of secretion leaders are 
those that employ a fragment of the yeast alpha-factor 
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will generally have a copy number ranging from about 5 
to about 200, and typically about 10 to about 150. A 
host containing a high copy number plasmid will 
preferably have at least about 10, and more preferably 
5 at least about 20. Enter a high or low copy number 

vector may be selected, depending upon the effect of the 
vector and the foreign protein on the host. See e.g.. 

Brake et al. , supra . 

Alternatively, the expression constructs can 

10 be integrated into the yeast genome with an integrating 
vector. Integrating vectors typically contain at least 
one sequence homologous to a yeast chromosome that 
allows the vector to integrate, and preferably contain 
two homologous sequences flanking the expression 

15 construct. Integrations appear to result from 

recombinations between homologous DNA in the vector and 
the yeast chromosome [Orr-Weaver et al. (1983) Methods 
in Enzvmol. 101:228-245]. An integrating vector may be 
directed to a specific locus in yeast by selecting the 

20 appropriate homologous sequence for inclusion in the 
vector. See Orr-Weaver et al., supra . One or more 
expression ccnstruct may integrate, possibly affecting 
levels of recombinant protein produced [Rine et al. 
p9ft.-n Proc. Natl. Acad. Sci. USA B0;6750]. The 

25 chromosomal sequences included in the vector can occur 
either as a single segment in the vector, which results 
in the integration of the entire vector, or two segments 
homologous to adjacent segments in the chromosome and 
flanking the expression construct in the vector, which 

30 can result in the stable integration of only the 
expression construct. 

Typically, extrachromosomal and integrating 
expression constructs may contain selectable markers to 
allow for the selection of yeast strains that have been 

35 transformed. Selectable markers may include 

biosynthetic genes that can be expressed in the yeast 
host, such as ADE2, HIS4, LEU2, TRP1 , and ALG7, and the 
G418 resistance gene, which confer resistance in yeast 
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cells to tunicamycin and G418 r respectively. In 
addition, a suitable selectable marker may also provide 
yeast with the ability to grow in the presence of toxic 
compounds , such as metal* For example , the presence of 
CUP1 allows yeast to grow in the presence of copper ions 
[Butt et al. (1987) Microbiol, Rev, 51:351]. 

Alternatively, some of the above described 
components can be put together into transformation 
vectors . Transformation vectors are typically 
comprised of a selectable marker that is either 
maintained in a replicon or developed into an 
integrating vector, as described above* 

Expression and transformation vectors, either 
extrachromosomal replicons or integrating vectors, have 
been developed for transformation into many yeasts. For 
example, expression vectors have been developed for, 
inter alia, the following yeasts: Candida albicans 
[Kurtz, et al. (198S) Mol. Cell. Biol. 6:1421, Candida 
maltosa [Kunze, et al. 91985) J. Basic Microbiol. 
25:141]. Hansenula polymorpha [Gleeson, et al. (1986) 
J. Gen. Microbiol. 132 :3459; Roggenkamp et al. (1986) 
Mol. Gen. Genet. 202 :3021, JCluyveromyces fragilis [Das, 
et al. (1984) J. Bacterid. 158 :11651, JTluyveromyces 
lactis [De Louvencourt et al. (1983) J. Bacterid. 
154 :737; Van den Berg et al. (1990) Bio/Technology 
1:135], Pichia guillerimondii [Kunze et al. (1985) J. 
Basic Microbiol. 25:141], Pichia pastoris [Cregg, et al. 
(1985) Mol. Cell. Biol. 5:3376; U.S. Patent Nos. 
4,837,148 and 4,929,555], Saccharomyces cerevisiae 
[Hinnen et al. (1978) Proc. Natl. Acad. Sci. USA 
75:1929; Ito et al. (1983) J. Bacteriol. 153 :1631, 
Schizosaccharomyces pombe [Beach and Nurse (1981) 
Nature 300:706], and Yarrowia lipolytica [Davidow, et 
al. (1985) Curr. Genet. 10:380471 Gaillardin, et al. 
(1985) Curr. Genet. 10:491. 

Methods of introducing exogenous DNA into 
yeast hosts are well-known in the art, and typically 
include either the transformation of spheroplasts or of 
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intact yeast cells treated with alkali cations. 
Transformation procedures usually vary with the yeast 
species to be transformed. See e.g., [Kurtz et al. 
(1986) Mol, Cell. Biol. 6:142; Kunze et al. 91985) J± 
5 Basic Microbiol . 25:141; Candida]} [Gleeson et al. 

91986) J* Gen* Microbiol. 132:3459; Roggenkamp et al. 
(1986) Mol. Gen. Genet. 202:302; Hansenula] ; [Das et 
al. (1984) J. Bacteriol. 158:1165; De Louvencourt et 
al. (1983) J. Bacteriol. 154:1165; Van den Berg et al. 

10 (1990) Bio/Technology 8:135; JCluyveromyces] ; [Cregg et 
al. (1985) Mol. Cell. Biol. 5:3376; Kunze et al. (1985) 
J. Basic Microbiol. 25:141; U.S. Patent Nos. 4 r 837,148 
and 4,929,555; Pichia}; [Hinnen et al. (1978) Proc. 
Natl. Acad. Sci. USA 75;1929; Ito et al. (1983) 

15 Bacteriol. 153:163 Saccharomyces]; [Beach and Nurse 

(1981) Nature 300 :706; Schizosaccharomyces] ; [Davidow et 
al. (1985) Curr. Genet. 10:39; Gaillardin et al. (1985) 
Curr. Genet. 10:49; Yarrowia}. 

20 Diacmostic Methods using Antigens 

The compositions comprising antigens of the 
invention, as well as the genetic material, can be used 
in diagnostic assays. Among the biologically useful 
information that can be obtained is excessive binding 

25 protein levels due to the presence of tumors, that 

result in increased production of either IGF or one of 
the IGFBP binding proteins (since the binding proteins 
are produced in the presence of excess IGF) . 
Additionally, a number of known disorders can be 

30 related to IGF concentrations. For example, some types 
of osteoporosis is related to IGF levels. 
Additionally, the binding proteins can be used in the 
identification, production, and purification of 
recombinantly produced IGFs. Methods for detecting the 

35 presence of IGFBP-5 comprise analyzing a biological 

sample such as a blood sample, cerebrospinal fluid, or 
tumor or bone tissue. 
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Typically, methods for detecting. analytes 
such as binding proteins of the invention are based on 
immunoassays. Such techniques are well known and need 
not be described here in detail. Examples include both 
heterogeneous and homogeneous immunoassay techniques. 
Both techniques are based on the formation of an 
immunological complex between the binding protein and a 
corresponding specific antibody. Heterogeneous assays 
for IGFBP-5 typically use a specific monoclonal or 
polyclonal antibody bound to a solid surface. Sandwich 
assays are increasingly popular. Homogeneous assays, 
which are carried out in solution without the presence 
of a solid phase, can also be used, for example by 
determining the difference in enzyme activity brought on 
by binding of free antibody to an enzyme-antigen 
conjugate. A number of suitable assays are disclosed in 
U.S. Patent Nos. 3,817,837, 4,006,360, 3,996,345. 

The solid surface reagent in the above assay 
is prepared by known techniques for attaching protein 
material to solid support material, such as polymeric 
beads, dip sticks, or filter material. These attachment 
methods generally include non-specific adsorption of the 
protein to the support or covalent attachment of the 
protein, typically through a free amine group, to a 
chemically reactive group on the solid support, such as 
an activate carboxyl, hydroxyl, or aldehyde group. 

In a second diagnostic configuration, known 
as a homogeneous assay, antibody binding to an analyte 
produces some change in the reaction medium which can 
be directly detected in the medium. Known general types 
of homogeneous assays proposed heretofore include (a) 
spin-labeled reporters, where antibody binding to the 
antigen is detected by a change in reported mobility 
(broadening of the spin splitting peaks), (b) 
fluorescent reporters, where binding is detected by a 
change in fluorescence efficiency, (c) enzyme reporters, 
where antibody binding effects enzyme/substrate 
interactions, and (d) liposome-bound reporters, where 



WO 92/03470 



PCT/US91/06140 



41 

binding leads to liposome lysis and release of 
encapsulated reporter. The adaptation of these methods 
to the protein antigen of the present invention follows 
conventional methods for preparing homogeneous assay 
5 reagents • 

Diagnostic Applications using Genetic Probes 

The genetic material of the invention can it- 
self be used in numerous assays as probes for genetic 

10 material present in naturally occurring materials. The 
analyte can be a nucleotide sequence which hybridizes 
with a probe comprising a sequence of (usually) at least 
about 16 consecutive nucleotides/ usually 30 to 200 
nucleotides, up to substantially the full sequence of 

15 the sequences shown above (cDNA sequences). The analyte 
can be RNA or cDNA. The sample is typically a as 
described in the previous section. A positive result is 
generally characterised as identifying a genetic 
material comprising a sequence at least about 70% 

20 homologous to a sequence of at least 12 consecutive 

nucleotides of the sequences given herein, usually at 
least about 80% homologous to at least about 60 
consecutive nucleotides within the sequences, and may 
comprise a sequence substantially homologous to the 

25 full-length sequences. In order to detect an analyte, 
where the analyte hybridizes to a probe, the probe may 
contain a detectable label . Probes that are 
particularly useful for detecting binding proteins are 
based on conserved regions of these proteins, 

30 particularly from amino acids PNCD and amino acids CWCV 
of BPS as shown in brackets in Figure 1. These amino 
acids are highly conserved in all of the related IGF 
binding proteins. Only IGFBP-1 has a difference, a *i 
for a D at position 191. 

35 One method for amplification of target 

nucleic acids, for later analysis by hybridization 
assays, is known as the polymerase chain reaction or 
PCR technique. The PGR technique can be applied to 
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detecting IGFBP-5 of the invention in suspect d samples 
using oligonucleotide primers spaced apart from each 
other and based on the genetic sequence set forth 
herein. The primers are complementary to opposite 
5 strands of a double stranded DNA molecule and are 

typically separated by from about 50 to 450 nt or more 
(usually not more than 2000 nt) . This method entails 
preparing the specific oligonucleotide primers and then 
repeated cycles of target DNA denaturation, primer 

10 binding r and extension with a DNA polymerase to obtain 

DNA fragments of the expected length based on the primer 
spacing. Extension products generated from one primer 
serve as additional target sequences for the other 
primer. The degree of amplification of a target sequence 

15 is controlled by the number of cycles that are performed 
and is theoretically calculated by the simple formula 2n 
where n is the number of cycles. Given that the average 
efficiency per cycle ranges from about 65% to 855 , 25 
cycles produce from 0.3 to 4.8 million copies of the 

20 target sequence. The FCR method is described in a number 
of publications, including Saiki etal . , Science (1985) 
230:1350-1354; Saiki et al. , Nature (1986) 324:163-165; 
and Scharf et al., Science (1986) 233:1076-1078. Also 
see U.S. Patent Nos. 4,683,194; 4,683,195; and 

25 4,683,202. 

The invention includes a specific diagnostic 
method for determination of IGFBP-5, based on selective 
amplification of IGFBP-5-encoding DNA fragments. This 
method employs a pair of single-strand primers derived 

30 from non-homologous regions of opposite strands of a DNA 
duplex fragment selected from the sequences set forth in 
Figure 1. These "primer fragments," which form one 
aspect of the invention, are prepared from IGFBP-5 
fragments such as described above. The method follows 

35 the process for amplifying selected nucleic acid 

sequences as disclosed in U.S. Patent No. 4,683,202, as 
discussed above. 
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Monoclonal Antibodies 

For both in vivo use of antibodies to IGFBP-5 
and anti-idiotype antibodies and diagnostic use, it may 
be preferable to use monoclonal antibodies. Monoclonal 
5 anti-virus particle antibodies or anti-idiotype 

antibodies can be produced as follows. The spleen or 
lymphocytes from an immunized animal are removed and 
immortalized or used to prepare hybridomas by methods 
known to those skilled in the art. To produce a human- 

10 human hybridoma, a human lymphocyte donor is selected. 
Epstein-Barr virus (EBV) can be used to immortalize 
human lymphocytes or a human fusion partner can be used 
to produce human-human hybridomas* Primary in vitro 
immunization with peptides can also be used in the 

15 generation of human monoclonal antibodies. Antibodies 
secreted by the immortalized cells are screened to 
determine the clones that secrete antibodies of the 
desired specificity. 



20 Assay for Biological Properties o f IGFBP-5 

The property of binding to an insulin-like 
growth factor is one of the biological activities of 
the proteins of the invention. These proteins may be 
conveniently tested in a binding assay using IGF-I 

25 [Rinderknecht, E. and Humbel , R • E • , J. Biol. Chem. 
(1978) 253 2769] or IGF-II [Rinderknecht , E. and 
Humbel, R.E., FEBS (1978) 89: 283], preferably IGF-II , 
in a labelled, e.g., iodinated form. For example, such 
an assay may conveniently include performing a gel 

30 electroohoresis (SDS-PAGE) of the proteins of the 

invent n, followed by a western blot of the gel, then 

125 

incubating the blot in the presence of [ I] IGF-I or 
II, washing the blot to remove free IGF-I or -II, and 
detecting the radioactivity on the blot. 



Sources of IGFBP-5 

While IGF-BPs of the invention originally 
means human IGF-BPs, IGF-BPs of mammals, e.g., murine, 
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procine, equine or bovine , are included within the 
d f inition of IGF-BPs as long as they comply with the 
required degree of homology. 

The IGF-BPs of the invention include those 
5 purified from a tissue extract or from a conditioned 

culture medium as well as those obtained by recombinant 
means . 

Uses of IGFBP-5 

10 Therapeutic applications of the binding 

proteins of the invention include its use as a single 
therapeutic agent and its use in combination with an 
IGF, the latter use being preferred* 

When used in combination with an IGF, a 

15 binding protein of the invention is suitable for use in 
the indications above mentioned, primarily as a growth 
inducing, tissue regenerating or would healing agent. 
Accordingly, the invention provides: 
i) use of a binding protein of the invention 

20 together with IGF in free or fixed combination for 

stimulating the growth of a subject, tissue or organ 
regeneration or wound healing, or 

ii) a method of stimulating the growth of a 
subject, tissue or organ regeneration or wound healing 

25 in a subject which comprises administering a thera- 
peutically effective amount of a binding protein of the 
invention together with a therapeutically effective 
amount of an IGF to a patient in need of such treatment, 
or 

30 iii) a pharmaceutical composition for 

stimulating the growth of a subject, tissue or organ 
regeneration or wound healing which comprises a binding 
protein of a invention together with an IGF and with a 
pharmaceutically acceptable carrier or diluent, or 

35 iv) a package containing separate until dose 

forms of a binding protein of the invention and an IGF, 
together with instructions for mixing or concomitant 
administration . 
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quantitated on the basis of release of [45]Ca from the 
bone explants. 

C) The IGF-binding proteins of the invention 
as well as other IGF-binding proteins potentiate the 
5 erythropoiet in-like effect of IGF-I. This may be, in 

particular, demonstrated by testing IGF-I, e.g. 10 ng,ml 
IGF-I, alone and in combination with the mature IGF 
binding protein of Figure 1, e.g. a 50 fil aliquot of a 
supernatant derived from a culture of a CHO cell line 

10 expressing the mature IGF binding protein of Figure 1, 
in a CFU-E assay as described in Fagg r B. Roitsch, C.A. 
Cell, Physiol. (1986) 126 :1. Whereas the result 
obtained with IGF-binding protein alone is not signi- 
ficantly different from the control, a synergistic 

15 effect of the combination is seen when compared to IGF-I 
alone . 

Further, the mitogenic activity of an IGF 
combined with a binding protein of the invention may be 
tested as follows: The incorporation of [3h] methyl- 

20 thymidine into CCL 39 cells (Chinese hamster lung 

fibroblasts) in culture is measured as described by 
Plouet et al. Cell. Miol. (1984) 30:105. In this 
assay, cell line CCI 39 is seeded in a plate at 40 000 
cells per well in 0.5 ml MEM culture medium (Gibco) 

25 containing 10% fetal calf serum 0.1% penicillin, 0.4% 
streptomycin and 0.5% fungizone. After 72 hours 
incubation at 37 °C in an atmosphere loaded with 5% C02- 
Cells are washed with MEM medium in the absence of fetal 
call serum and then cultured in his medium for 20 hours. 

30 At this stage, the cell culture is confluent and an IGF 
or a binding protein or both together are inoculated 
each at a dose of 10 ng to 200 ng culture medium. When 
added together the molar ratio must be 1:1. The test 
sample is incubated at 37 °C for 24 hours and then added 

35 with 1/tCi [3h] methylthymidine in lO/il PES. After 4 

hours incubation the incorporation of methylthymidine is 
stopped washing cells with PBS. Cells are fixed with 
0.5 ml trichloroacetic acid (5%) for 30 min. washed with 
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water and finally lys d with 0.5 ml of NaOH 0.1M for 2 
hours at 37 °C 0.5 ml of lysate is transferred into a 
scintillation flask and mixed with 3 ml of 
scintillation liquid for measuring /?-radioactivity. 
5 The binding protein potentiates the mitogenic activity 
of IGF although the radioactivity level that is 
measured when a binding protein is used alone is not 
substantially different from that of the control 
sample . 

10 More particularly a binding protein of the 

invention, in combination with an IGF is useful 
a) for treating hypopituitarism. Laron-type dwarfism, 
osteoporosis, anemias especially complications fol- 
lowing an chronic renal failure and liver or kidney 

15 deficiency and b) for promoting healing of wounds such 
as ulcers and burns or those occurring in accidental 
events or resulting from surgery. 

For use in association with a binding protein 
of the invention. IGF is preferably selected from IGF-I 

20 as described in Rinderknecht , E. and Humbel, R.E., 

J. Biol. Chem. (1978) 253:2769. IGF-II as described in 
Rinderknecht, E. and Humbel, R.E., FEBS (1978) 89:283 
and any derivative or fragment of IGF-I and IGF-II 
having an insulin-like growth factor activity. Most 

25 preferably, this is IGF-II. 

For use in association with an IGF, a binding 
protein of the invention is preferably a protein which 
is from 85% to 100% homologous with pre IGF-BP or IGF-BP 
as shown in Figure 1. 

30 When not associated with IGFs, binding 

proteins of the invention have further therapeutic 
applications in any physiological disorders resulting 
from an excessive production of free IGFsa, e.g. IGF- 
producing cancers such as breast or kidney cancer, 

35 diabetic proliferative retinopathy or abnormal growth 
of tall children with high serum level of free IGF. 

Accordingly, the invention also provides: 
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(i) the use of a binding protein of the 
invention for treating physiological disorders result- 
ing from an excessive production of free IGF by a 
mammalian, for example human body, e.g. IGF-producing 
cancers, diabetic retinopathy or abnormal growth of 
tall subjects, or 

(ii) a method of treating physiological 
disorders resulting from an excessive production of 
free IGF, e.g. IGF-producing cancers, diabetic 
retinopathy or abnormal growth of a subject which 
comprises administering a therapeutically effective 
amount of aa binding protein of the invention to a 
subject in need of such treatment, or 

(iii) a pharmaceutical composition for 
treating physiological disorders resulting from an 
excessive production of free IGF, e.g. IGF-producing 
cancers, diabetic retinopathy or abnormal growth of a 
subject which comprises a binding protein of the 
invention in association with a pharmaceutically 
acceptable carrier or diluent, or 

(iv) a method of delivering IGFs to specific 
organs or tissues based on the differential binding 
properties of IGFBP-5, as indicated by biological 
testing. 

Fragments of mutated forms of the pre-IGF-BP 
or IGF-BP as shown in Figure 1 are of particular value 
for treating the physiological disorders resulting from 
an excessive production of free IGF in the human body. 

A binding protein of the invention, alone or 
in combination with an IGF, may be administered by any 
conventional route suitable for peptides, or particular 
enterally, e.g. in the form of tablets or capsules or, 
preferably parenterally, e.g. subcutaneous ly or 
intravenously in the form of injections of infusions. 
Further, it may be also used topically, e.g. in the 
form of ointments or suspensions when used, e.g. as a 
wound healing agent. 
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For all the above indications the appropriate 
dosage will be cours vary depending upon, for example, 
the nature and severity of the disorder to be treated 
and the mode of administration. For example, 
satisfactory results may be obtained in the treatment 
of osteoporosis or anemia at daily dosages from about 
to 0.1 /tg/kg to 40 /*g/kg body weight, preferably from 
about 0.5 fig/kg to about 20 /*g/kg body weight of a 
binding protein of the invention. In larger mammals, 
for example humans, as indicated daily dosage is from 
about 5 fig conveniently administered parenterally, for 
example once a day. For wound healing, a daily dose of 
from 0.1 to 10 /tg of a protein of the invention per cm2 
wound area is suitably indicated in larger mammals, for 
15 example humans. This is conveniently administered once 
a day. When used in combination with an IGF, the molar 
ratio of the binding protein to IGF is preferably from 
0.1:1 to 5:1, more preferably from 0.5:1 to 2:1, most 
preferably 1:1. 

20 Pharmaceutical compositions of the invention 

may be manufactured in conventional manner. 

Other uses for the binding proteins of the 
invention include various uses in the production of IGF 
molecules by recombinant techniques. The binding 

25 proteins of the invention can be used to detect yeast- 
produced IGF in native (active) conformation (as 
opposed to inactivated forms). Additionally, the 
proteins of the invention can be used as carrier 
(possibly in the form of co-expressed proteins) in the 

30 production of IGF. As the binding protein stabilized 
IGF in vivo , they are expected to do the same in vitro. 
The binding proteins can also be used to purify IGF 
produced in yeast by attaching them to a solid surface 
(such as in affinity chromatography). 

35 While the invention has been described with 

referenc to particular mbodiments, methods, 
construction, and use, it will be apparent to those 
skilled in the art that various changes and modifi- 
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cations can be made without departing from the 
invention. 

Example 1 

5 Sepharose-IGF T affinity column 

Sixty mg recombinant human IGF I (Ciba-Geigy 
AG, Basel, Switzerland) was dissolved in 20 ml 0.1 M 
NaHCOg, pH 8.3, containing 0.5 M NaCl and coupled to 
CNBr-activated Sepharose 48 (4 g dry gel) according 
10 to the protocol of the supplier (Pharmacia Fine 

Chemicals, Uppsala, Sweden). The gel was equilibrated 
with 500 ml 0.05 M sodium phosphate buffer/0.5 M NaCl, 
pH 6.5, in a 1.5 x 15 cm glass column (gel bed volume 
15 ml). 

15 

Purification of serum IGFBPs 

This procedure was carried out according to a 
modification of the procedure of Martin and Baxter 
(18,19). Note that references shown in the examples in 

20 parentheses refer to the references listed by number in 
the Relevant Literature section of this specification. 
One liter of out-dated human citrate plasma was stirred 
for 2 hours at room temperature with 50 U (1 ml) 
thrombin-calcium, filtered through cheesecloth and 

25 acidified. Dissociated IGF was removed with SP-Sephadex 
C-25. The pH was subsequently adjusted to S.5 and the 
precipitate removed by centrifugation at 20,000 rpm for 
30 min. The supernatant was pumped through the 
Sepharose-IGF I affinity column described above at 

30 34 ml/min and the column washed with 500 ml 0.05 M 

sodium phosphate buffer/0.5 M NaCl, pH 6.5. Binding 
protein (i.e., IGFBP) was eluted with 40 ml 0.5 M acetic 
acid, dialyzed 3 times against two liters of 0.1 M 
ammonium acetate, and lyophilized. The lyophilized 

35 material (40 mg) was dissolved in 4 ml 0.1 M 
heptafluorobutyric acid containing 20% (v/v) 
acetonitrile , and the insoluble material was removed by 
centrifugation at 10,000 for 10 min. The clear 
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supernatant was subjected to HPLC (2 runs with 2 ml 
each) on a Nucleosil C 18 column (Macherey-Nagel, D ren, 
FRG) (19). Effluent fractions were dried in a Speed-Vac 
(Savant Instruments, Hicksville, NY) , taken up in 1 ml 
5 0.01 M acetic acid, and dried again. The resulting 
material was dissolved in 250 /il H 2 0 for ligand blot 
analysis (see below) and silver staining (20). 



10 



-^ ^I-IGF ligand blot analysis 

The method of Hossenlopp et al. (21) was used 
with slight modifications (6,19). Five-/tl aliquots of 
the HPLC effluent fractions were subjected to 
electrophoresis on 15% SDS polyacrylamide slab gels 
15 under nonreducing conditions. The 14 C-labeled 

molecular weight marker (Rainbow Marker, Amersham, UK) 
was reduced. The gels were transblotted on 
nitrocellulose membranes and processed as described 
(21). Membranes were incubated for 6 h at room 
20 temperature in a sealed plastic bag with 3 x 10 cpm 
125 I-labeled IGF II (22). After several washes the 
air-dried membranes were exposed for 12-48 h at -70°C 
to an X-ray film (Kodak, X-OMAT, AR) in a Kodak 
X-OMATIC cassette (Eastman, Rochester, NY) . 
25 125 I-IGF II was chosen as the tracer for 

screening because not all of the bands are detected 
with 125 I-IGF I (see results). 



Electroblottincr on polyvinyl idene difluoride 

30 (Immobilion) membrane 

Ten to thirty fig of HPLC-purif ied IGFBP was 
electrophoresed as described above (polyacrylamide slab 
gels 15x15x0.15 cm) under reducing conditions and 
electroblotted (2 h at 0.8 A) on an Immobilon membrane 

35 (Millipore Corp., Bedford, MA) as described by 

Matsudaira (23) . The membrane was stained for 5 sec 
with 0.1% Coomassi Blue R-250 in 50% methanol, 
destained in 50% methanol/10% acetic acid for 5 min at 
room temperature, and then thoroughly rinsed in HjO. 
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The membrane was air-dri d, and the protein bands were 
cut out and stored at -20°C. 

Amino acid analysis was performed by 
automated Edman degradation using an Applied Biosystems 
Model 4 7 OA protein sequencer (Poster City, CA) (25). 

Tissue and RNA isolation 

Human osteosarcoma tissue was obtained from 
Dr. Marshall Urist, UCLA. Total RNA was isolated by 
the guanidinium thiocyanate method (27). An Osterizer 
was used to homogenize the tissue Poly(A) + RNA was 
purified by a single fractionation over oligo (dT) 
cellulose (29) . 

Oligonucleotide synthesis 

Oligonucleotide adaptors, probes, and 
sequencing and PCR primers were synthesized by the 
phosphoramidite method with an Applied Biosystems 
(Foster City, CA) model 380A synthesizer, purified by 
polyacrylamide gel electrophoresis, and desalted on 
SEP-PAK C 18 cartridges (Waters; Milford, MA). 

A 14-mer oligonucleotide (5' CCTGTAGATCT- 
CCG 3') and a 18-mer oligonucleotide (5' AATTCGGAGAT- 
CTACAGG 3') were synthesized and used as the EcoRl 
adaptors for the human osteocarcinoma cDNA library 
constructed in XZAP. The 14-mer was phosphorylated 
(30), then immediately heated to 95 °C for 15 min to 
inactivate the polynucleotide kinase. The adaptors 
also contain an internal Bgl II site and are described 
more fully in the following section describing the 
construction of the cDNA library. 

The two PCR primers for BPS were: (1) a 
"sense- primer consisting of a mixture of 48 26-mers 
[5' AGATCTGAATTCGA ( C/ T ) GA ( A/G ) GCXAT ( A/T/C ) CA 3'] and 
(2) an "antisense" primer consisting of a mixture of 64 
26-mers [5' AGATCTGAATTCGC ( T/C ) A( G/A) ( T/C ) TTXG- 
C(T/C)TC 3'] where X denotes all four deoxynucleotides . 
Eco RI sites were included in the primers to allow for 
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subcloning into M13 sequencing vectors. The RP5 probe 
consisted of a 20-mer (5' TCGGAGCAGGGCGGGCAGTG 3') and a 
7-mer primer (5' CACTGCC 3'). 

5 PCR amplification of BP5 s equences 

The PCR reactions were performed according to 
the supplier of the PCR kit (Perkin/Elmer/Cetus) using 
the PCR primers described (see Oligonucleotide 
Synthesis section) at a final concentration of 8 /M. 

10 The template cDNA was synthesized from 2.5 fig of human 
osteosarcoma (Ost2) poly (A) + RNA. The conditions of 
cDNA synthesis were identical to those described below 
for first strand cDNA synthesis (see Construction of 
cDNA Library) . The cDNA was fractionated on Biogel 

15 A-15m, recovered by ethanol precipitation, and 

resuspended in 100 pi of sterile water. From 2.5 to 
5 pi of cDNA template were used for each PCR reaction. 
Thirty-five cycles of PCR were performed in a 
Perkin/Elmer/Cetus DNA thermocycler . The first 

20 10 cycles consisted of a 94°C, 1 min. denaturation step; 
a 40°C, 1 min. annealing step; and a 40°C, 1 min. 
extension step. The next 25 cycles consisted of a 
94°C, 1 min. denaturation step; a 55°C / 1 min. annealing 
step; and a 72°C, 1 min. extension step. The final 

25 extension step at the last cycle was 7 min. Samples 
were extracted once with phenol/ chloroform/ IAA 
(1:1:0.04), once with chloroform/ IAA (24:1), recovered 
by ethanol precipitation, digested with EcoRI, and 
fractionated by electrophoresis a. a 7% acrylamide, 

30 lx TBE gel (30). DNA migrating between 40-70 b.p. was 
excised from the gel, purified by passage over an 
Elutip-d column, ligated to Eco-RI cut ml3 mol8, and 
introduced into DH5aF' for DNA sequencing. 

35 Construction of the cDNA library 

First-strand cDNA was synth sized from human 
osteosarcoma (0st3) poly (A) + RNA as described (33), but 
with the following modifications: 10 pg of poly(A) RNA 
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was heated to 65 °C for 3 min in 20 /il 5 mM Tris- 
hydrochloride (pH 7.5) , immediately placed on ice for 
1 min, and then adjusted (at room temperature) to 
contain 50 mM Tris-hydrochloride (pH 8.3 at 42°C), 8 mM 
5 MgCl 2 / 30 mM KC1, 10 mM dithiothreitol, 2 mM each of 
dATP, dGTP, dTTP and [a" 32 ] dCTP (300 cpm/pmol), 60 U 
RNasin, and 2.5 fig of oligo (dT) 12 _ 18 . Sixty U of 
cloned moloney murine leukemia virus reverse 
transcriptase was added to initiate cDNA synthesis 

10 (total reaction volume 40 1), and the reaction was 

continued for 60 min at 42°C. The second cDNA strand 
was synthesized and ligated to the EcoRl adaptors (see 
Oligonucleotide Synthesis section) as described (34). 
The dscDNA was phosphorylated (30) and then adjusted to 

15 0.5 M NaCl/25 mM EDTA and heated at 75 °C for 15 min to 
inactivate the polynucleotide kinase. The dscDNA was 
separated from unligated adaptors by chromatography on 
Biogel A-15m and recovered by ethanol precipitation. 
The dscDNA was ligated to EcoRl-cut XZAP (Stratagene) as 

20 described by the supplier, but including 15% poly- 
ethylene glycol (PEG) 8000 (Sigma) in the reaction 
medium, a modification previously described (35). 
The ligated DNA was recovered by centrif ugation 
(12,000 x g), washed with chloroform, dried, resuspended 

25 in 4 /il H 2 <D, and incubated with an in vitro packaging 
extract (Stratagene) according to the supplier. A 
library of 2.3 x 10 independent recombinant clones was 
obtained. Recombinant phages were propagated in E. coli 
BB4 ( Stratagene ) . 

30 

Screening of the cDNA library 

Approximately 300,000 recombinant phages from 
the Ost3 cDNA library were plated (50,000 phages/ 13 7 mm 
dia. plate) in E. coli BB4 and grown for 5-6 hours at 
35 37 °C. The phages were transferred onto nitrocellulose 
filters (Millipore, HATF 137), processed (36), and 
screened with the BP4 or BPS probes. The BP 4 probe was 
labeled with T A polynucleotide kinase and [7~ 32p ] ATP 
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(28) to a specific activity of 1-2 x 10° cpm//*g. The 
BP 5 probe was labeled according to Feinberg and 
Vogelstein (42). The filters were prehybridized for 
1-2 h at 37°C in 5 x SSC (1 SSC = 0.15 M sodium 
5 chloride/ 0.0 15 M sodium citrate, pH 7) 40% formamide, 
5 x Denhardt's solution (1 x Denhardt's solution = 
0.02% polyvinylpyrrolidone/0.02% Ficoll/0.02% bovine 
serum albumin), 10% dextran sulfate, 50 mM sodium 
phosphate, pH 6.8, 1 mM sodium pyrophosphate, 0.1% 

10 NaDodS0 4 , and 50 fig/ml denatured salmon sperm DNA. 
Labeled probe was added to a concentration of 10 6 
cpm/ml, and hybridization was continued overnight at 
37 °C with gentle shaking. The filters were washed in 
2 x SCC/0.1% NaDodS0 4 at 65°C and exposed to Kodak 

15 XAR-2 films with a DuPont Lightning Plus intensifying 
screen overnight at -80 °C. Areas of plaques giving 
duplicate signals were picked, replated, and rescreened 
until pure plagues were obtained. 
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Plasmid Isolation, Subclonincr, and Sequencing 

Bluescript SK(-) containing BPS cDNA inserts 
were released from XZAP by the K13 rescue/excision 
5 protocol described by the supplier (Stratagene) . 

Plasmid DNA was isolated by the alkaline lysis method 
(30). The inserts were excised from the Bluescript SK(- 
) vector by a Bgl II digestion and fractionated by 
agarose gel electrophoresis. Inserts were excised from 

10 the gel and passively eluted for 12 h with gentle 

shaking in 10 mM Tris-hydrochloride pH 7.5, 1 mM EDTA 
(TE) i purified by elutip-D as described by supplier 
(Schleicher and Schuell) r and subcloned into a M13 
sequencing vector (37). DNA sequencing was performed by 

15 the dideoxy chain termination method (38) using M13 

primers as well as specific internal primers . Ambiguous 
regions were resolved using 7-deaza-2-deo3^guanidine- 
triphosphate (39) and sequenase (US Biochemicals ) . 

20 Deposit of Genetic Information 

The genetic sequences set forth in Figure 1 
are on deposit with the American Type Culture 
Collection where they are identified as follows: 

25 Cloned Internal In E. coli ATCC 

Protein Identifier Strain Number 

IGFBP-5 pBsPB5.5 RR1 A M15 58387 
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WE CLAIM: 

1. A recombinant DNA molecule comprising a 
nucleic acid sequence encoding insulin-like-growth- 

5 factor binding protein 5 (IGFBP-5) or a subsequence 
thereof comprising at least 10 nucleotides. 

2. The recombinant DNA molecule of Claim 1, 
wherein said sequence is as set forth for IGFBP-5 in 

10 Figure 1. 

3. The recombinant DNA molecule of Claim 1, 
which comprises a human DNA sequence. 

15 4. The recombinant DNA molecule of Claim l f 

wherein said sequence is a genomic sequence. 

5. The recombinant DNA molecule of Claim 1, 
wherein said sequence is a cDNA sequence. 

20 

6. The recombinant DNA molecule of Claim 1, 
wherein said molecule is contained in pBsBP5.5. 

7 . A recombinant microorganism or cell line 
25 containing the DNA molecule of Claim 1. 

8. The microorganism of Claim 7 , wherein said 
microorganism is a yeast. 

30 9. The cell line of Claim 7, wherein said cell 

line is a CHO cell line. 

10. A method for producing IGFBP-5 or a fragment 
thereof comprising growing a recombinant host 
35 containing the DNA molecule of Claim 1 under conditions 
wherein said IGFBP-5 or fragment is expressed by the 
host and then isolating the expressed IGFBP-5 or 
fragment. 
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11. The method of Claim 10, wherein said host is 
a microorganism. 

5 12. The method of Claim 10, wherein said host is 

a eucaryotic cell. 

13. The method of Claim 10, wherein said host is 
a non-human transgenic animal. 

10 
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FIGURE 1 

DMA and Amino Acid Sequence of Clone Encoding hIGPBP-5 

Met Leu. Pro Leu Cya Leu val Ala Ala 

<^TTCa»»!CSACJtfWGGTC ATG CtC CCC CTC TGC Ctt^GTG GCC GCC 

^30 MA , 

T ,_ Hlfl c-a Pr0 pro era S*r Glu Glu Lya Leu Ala' Arg Cya Arg 
^^ScCGCCC^TCC(MGAGAafiCBiGCGCGCIGCCGC 

©IS vai sit- era Glu Glu Leu val Arg Glu rro Gly cya git 

^SSroGGC^G^G^ClGGIGSSGaGCCSGGCWCGGC 

rv« nr. Ala Thr cvs Ala Leu Gly Leu Gly Met Fro eye Gly val 

T([t TGC GCC ACT TGC GCC CTS GGC XTG GGG ATS CCC TGC GOG GTG 

Jz vf-~ xra era Gly Ser Gly Lau Arg cyr xyr Pro Pro Arg 
SSScll^^^^CTGCKTGCTACCCGCCCCGA 

90 

r1v elu ?ro uu Kls THr Lau Mat Hi* Gly Oln 0I7 v*l 

i*e! riu t^u Ala Glu lie Glu Ala Ila Oln Ola Scr Ler Ola 
TGC AT8 SAG CXG GCG GAG ATC 6M GCC ATC CM GAA AGC CW CM 



•m km cm A*o Olu Gly As? Sia Pzo An Aan Ser Pne Ser 
CCC ICT GAC AM GAC GAG GGT GAC CAC CCC AAC AAC AGC TTC AGC 

,. A 140 

•m er« Ser Ala Ila Aap Arg Arg Cya Leu Gin Lye Hie Phe Ala 

CC?3xi£GaCA?GMcSS«!GCC^ 

150 



t— ti. &m An am Ser Shr S«r Gly Gly Lya Met Lye val Aan 

ICO 170 

b.t« bm &m du Abo Ala Arv Pro Val Pro Gin Cly *•* Cya 

S«c«ccc^cMc«GMcacCTG«cec 

1B0 

ciu i*u Hia Aw Ala Leu Glu Arg Lau Ala Ala Ser Gin 
CMACCGAGCWCWCaGaCTGCJUSCGG 

<i«o 2°° _ 

ser Arg ttr Hia Glu Asp Leu Tyr lie Ili fro lie Pro Aan Cya 

i£c«ACCCACCMGACCTCJACA3ttATCCCCA^ 

210 

aSSSMSSSBSSRSSSSS 
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230 / 
? 2 £ A1« XT9 GlT I»Y« tXP CT» V»l A»p AT* *T» Xh*» fly 

«S «S Sfl WC AM 3c *3 WT 0«J GAC CG<J MO AC* «0 

340 

val lm Leu Pro Gly Gly Lw Glu Pro Ly« Oly Jl« J** J»P Jy? 

GIG AAC CM CCS <WC CCC CTC CAS CCA AAO GGG GAG CTG «AC TGC 

CTCA«arcCCCT<WXACTa^ 
ICACgCTCACTCCWa^TCCCTC C CCC CC ^ 
fiTOTCCCTCTt»MCSTOTOT C T C l Gira ^ 
AGltfKCTQ3G<HlGTTCTCSCTGGTCCTACACACCCCAAGAC^ 

CCCAaGAGGTCTGAGOETGGlGTGTTK^ 

TTCCTTCACTCATCCAGCCACCTAAAAACATTTACTCACCATQXACSACCTCCCAC^ 

AGTrTTCAgCCTTGG<aGGTTTTATICTG^ 
CCTAIAAG6AGASKCAA6<XTGTGSCAG^aGAAAAA 

AAGA<yOICT*CCTTGMCJ^TCCITCC^^ 
AGGAA(XGTCGOTTA<5CA<JATGGAG1!aJI!G3^^ 
CAGAClTGCa^T C (XyiTC XC ^ 
CACCATCT6CTTSGTTG«aGGCAGCTe^^ 

GGGGATGASGGGCTTCCWGGICCTGraCCClAC^^ 

OvCCGGGAlGAAO^TCCrTCC^ 

ATClC«TaXCTACACCiaX^CC30CCTCCCTA^ 

TCXTTTITSZAACAAAACCCIGZM^ICTCOSAAITC 
Translated ml. weignt - 27936.61 



nw- 25978.7.4 f r remainder of prot in 



WO 92/03470 



3/4 
FIGURE 



PCT/US91/06140 



Comparison of Various Human IGPBP Amino Acid Saqu ncet 
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